Goto

Collaborating Authors

 citizen science


Decoding the fingerprint of a humpback whale

Popular Science

Breakthroughs, discoveries, and DIY tips sent every weekday. It is in these waters that marine mammal ecologist Ari Friedlaender shuts off the inflatable boat's engine and waits. This is the edge of the world--remote, hostile, and stunningly alive. Beneath the hull, the dark sea churns with wonder abound. A humpback whale (Megaptera novaeangliae) emerges, slow, deliberate, and gentle in its curious demeanor, casting a ripple across the surface.


Machine Learning and Citizen Science Approaches for Monitoring the Changing Environment

Zhou, Sulong

arXiv.org Artificial Intelligence

This dissertation will combine new tools and methodologies to answer pressing questions regarding inundation area and hurricane events in complex, heterogeneous changing environments. In addition to remote sensing approaches, citizen science and machine learning are both emerging fields that harness advancing technology to answer environmental management and disaster response questions. Freshwater lakes supply a large amount of inland water resources to sustain local and regional developments. However, some lake systems depend upon great fluctuation in water surface area.


Leveraging Citizen Science for Flood Extent Detection using Machine Learning Benchmark Dataset

Ramasubramanian, Muthukumaran, Gurung, Iksha, Gahlot, Shubhankar, Hänsch, Ronny, Molthan, Andrew L., Maskey, Manil

arXiv.org Artificial Intelligence

Accurate detection of inundated water extents during flooding events is crucial in emergency response decisions and aids in recovery efforts. Satellite Remote Sensing data provides a global framework for detecting flooding extents. Specifically, Sentinel-1 C-Band Synthetic Aperture Radar (SAR) imagery has proven to be useful in detecting water bodies due to low backscatter of water features in both co-polarized and cross-polarized SAR imagery. However, increased backscatter can be observed in certain flooded regions such as presence of infrastructure and trees - rendering simple methods such as pixel intensity thresholding and time-series differencing inadequate. Machine Learning techniques has been leveraged to precisely capture flood extents in flooded areas with bumps in backscatter but needs high amounts of labelled data to work desirably. Hence, we created a labeled known water body extent and flooded area extents during known flooding events covering about 36,000 sq. kilometers of regions within mainland U.S and Bangladesh. Further, We also leveraged citizen science by open-sourcing the dataset and hosting an open competition based on the dataset to rapidly prototype flood extent detection using community generated models. In this paper we present the information about the dataset, the data processing pipeline, a baseline model and the details about the competition, along with discussion on winning approaches. We believe the dataset adds to already existing datasets based on Sentinel-1C SAR data and leads to more robust modeling of flood extents. We also hope the results from the competition pushes the research in flood extent detection further.


Employing Crowdsourcing for Enriching a Music Knowledge Base in Higher Education

Lyberatos, Vassilis, Kantarelis, Spyridon, Kaldeli, Eirini, Bekiaris, Spyros, Tzortzis, Panagiotis, Mastromichalakis, Orfeas Menis -, Stamou, Giorgos

arXiv.org Artificial Intelligence

This paper describes the methodology followed and the lessons learned from employing crowdsourcing techniques as part of a homework assignment involving higher education students of computer science. Making use of a platform that supports crowdsourcing in the cultural heritage domain students were solicited to enrich the metadata associated with a selection of music tracks. The results of the campaign were further analyzed and exploited by students through the use of semantic web technologies. In total, 98 students participated in the campaign, contributing more than 6400 annotations concerning 854 tracks. The process also led to the creation of an openly available annotated dataset, which can be useful for machine learning models for music tagging. The campaign's results and the comments gathered through an online survey enable us to draw some useful insights about the benefits and challenges of integrating crowdsourcing into computer science curricula and how this can enhance students' engagement in the learning process.


Lessons Learned from a Citizen Science Project for Natural Language Processing

Klie, Jan-Christoph, Lee, Ji-Ung, Stowe, Kevin, Şahin, Gözde Gül, Moosavi, Nafise Sadat, Bates, Luke, Petrak, Dominic, de Castilho, Richard Eckart, Gurevych, Iryna

arXiv.org Artificial Intelligence

Many Natural Language Processing (NLP) systems use annotated corpora for training and evaluation. However, labeled data is often costly to obtain and scaling annotation projects is difficult, which is why annotation tasks are often outsourced to paid crowdworkers. Citizen Science is an alternative to crowdsourcing that is relatively unexplored in the context of NLP. To investigate whether and how well Citizen Science can be applied in this setting, we conduct an exploratory study into engaging different groups of volunteers in Citizen Science for NLP by re-annotating parts of a pre-existing crowdsourced dataset. Our results show that this can yield high-quality annotations and attract motivated volunteers, but also requires considering factors such as scalability, participation over time, and legal and ethical issues. We summarize lessons learned in the form of guidelines and provide our code and data to aid future work on Citizen Science.


How you can contribute to scientific discoveries from your couch

PBS NewsHour

When you picture a scientist, do you see a white coat-clad PhD-holder pipetting away at a lab bench? Or maybe a skygazer with a different day job who goes out on clear nights for a good view of the stars? Historically speaking, both of those examples fit the bill. German-British astronomer William Herschel was originally an amateur who observed the night sky using homemade telescopes. He discovered Uranus in 1781, working alongside his sister, Caroline Herschel, who made multiple discoveries herself.


Analyzing social media with crowdsourcing in Crowd4SDG

Bono, Carlo, Mülâyim, Mehmet Oğuz, Cappiello, Cinzia, Carman, Mark, Cerquides, Jesus, Fernandez-Marquez, Jose Luis, Mondardini, Rosy, Ramalli, Edoardo, Pernici, Barbara

arXiv.org Artificial Intelligence

Social media have the potential to provide timely information about emergency situations and sudden events. However, finding relevant information among millions of posts being posted every day can be difficult, and developing a data analysis project usually requires time and technical skills. This study presents an approach that provides flexible support for analyzing social media, particularly during emergencies. Different use cases in which social media analysis can be adopted are introduced, and the challenges of retrieving information from large sets of posts are discussed. The focus is on analyzing images and text contained in social media posts and a set of automatic data processing tools for filtering, classification, and geolocation of content with a human-in-the-loop approach to support the data analyst. Such support includes both feedback and suggestions to configure automated tools, and crowdsourcing to gather inputs from citizens. The results are validated by discussing three case studies developed within the Crowd4SDG H2020 European project.


Citizen science, supercomputers and AI

#artificialintelligence

Citizen scientists have helped researchers discover new types of galaxies, design drugs to fight COVID-19, and map the bird world. The term describes a range of ways that the public can meaningfully contribute to scientific and engineering research, as well as environmental monitoring. As members of the Computing Community Consortium (CCC) recently argued in a Quadrennial Paper, "Imagine All the People: Citizen Science, Artificial Intelligence, and Computational Research," non-scientists can help advance science by "providing or analyzing data at spatial and temporal resolutions or scales and speeds that otherwise would be impossible given limited staff and resources." Recently, citizen scientists' efforts have found a new purpose: helping researchers develop machine learning models, using labeled data and algorithms, to train a computer to solve a specific task. This approach was pioneered by the crowdsourced astronomy project Galaxy Zoo, which started leveraging citizen scientists in 2007.


Empowering Local Communities Using Artificial Intelligence

Hsu, Yen-Chia, Huang, Ting-Hao 'Kenneth', Verma, Himanshu, Mauri, Andrea, Nourbakhsh, Illah, Bozzon, Alessandro

arXiv.org Artificial Intelligence

Many powerful Artificial Intelligence (AI) techniques have been engineered with the goals of high performance and accuracy. Recently, AI algorithms have been integrated into diverse and real-world applications. It has become an important topic to explore the impact of AI on society from a people-centered perspective. Previous works in citizen science have identified methods of using AI to engage the public in research, such as sustaining participation, verifying data quality, classifying and labeling objects, predicting user interests, and explaining data patterns. These works investigated the challenges regarding how scientists design AI systems for citizens to participate in research projects at a large geographic scale in a generalizable way, such as building applications for citizens globally to participate in completing tasks. In contrast, we are interested in another area that receives significantly less attention: how scientists co-design AI systems "with" local communities to influence a particular geographical region, such as community-based participatory projects. Specifically, this article discusses the challenges of applying AI in Community Citizen Science, a framework to create social impact through community empowerment at an intensely place-based local scale. We provide insights in this under-explored area of focus to connect scientific research closely to social issues and citizen needs.


Revisiting Citizen Science Through the Lens of Hybrid Intelligence

Rafner, Janet, Gajdacz, Miroslav, Kragh, Gitte, Hjorth, Arthur, Gander, Anna, Palfi, Blanka, Berditchevskaia, Aleks, Grey, François, Gal, Kobi, Segal, Avi, Walmsley, Mike, Miller, Josh Aaron, Dellerman, Dominik, Haklay, Muki, Michelucci, Pietro, Sherson, Jacob

arXiv.org Artificial Intelligence

Artificial Intelligence (AI) can augment and sometimes even replace human cognition. Inspired by efforts to value human agency alongside productivity, we discuss the benefits of solving Citizen Science (CS) tasks with Hybrid Intelligence (HI), a synergetic mixture of human and artificial intelligence. Currently there is no clear framework or methodology on how to create such an effective mixture. Due to the unique participant-centered set of values and the abundance of tasks drawing upon both human common sense and complex 21st century skills, we believe that the field of CS offers an invaluable testbed for the development of HI and human-centered AI of the 21st century, while benefiting CS as well. In order to investigate this potential, we first relate CS to adjacent computational disciplines. Then, we demonstrate that CS projects can be grouped according to their potential for HI-enhancement by examining two key dimensions: the level of digitization and the amount of knowledge or experience required for participation. Finally, we propose a framework for types of human-AI interaction in CS based on established criteria of HI. This "HI lens" provides the CS community with an overview of several ways to utilize the combination of AI and human intelligence in their projects. It also allows the AI community to gain ideas on how developing AI in CS projects can further their own field.